Search CORE

23 research outputs found

Position Models and Language Modeling

Author: C. Kermorvant
F. Thollard
F. Thollard
J. Callut
J. Daciuk
M. Marcus
P. Dupont
V. Siivola
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

International audienceIn statistical language modelling the classic model used is

n

-gram. This model is not able however to capture long term dependencies, \emph{i.e.} dependencies larger than

n

. An alternative to this model is the probabilistic automaton. Unfortunately, it appears that preliminary experiments on the use of this model in language modelling is not yet competitive, partly because it tries to model too long term dependencies. We propose here to improve the use of this model by restricting the dependency to a more reasonable value. Experiments shows an improvement of 45\% reduction in the perplexity obtained on the Wall Street Journal language modeling task

Crossref

HAL-UJM

A Discriminative Model of Stochastic Edit Distance in the form of a Conditional Transducer

Author: A. Dempster
E. Vidal
E.S. Ristad
F. Thollard
G. Bouchard
R. Durbin
R.A. Wagner
R.C. Carrasco
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

pages 240-252International audienceMany real-world applications such as spell-checking or DNA analysis use the Levenshtein edit-distance to compute similarities between strings. In practice, the costs of the primitive edit operations (insertion, deletion and substitution of symbols) are generally hand-tuned. In this paper, we propose an algorithm to learn these costs. The underlying model is a probabilitic transducer, computed by using grammatical inference techniques, that allows us to learn both the structure and the probabilities of the model. Beyond the fact that the learned transducers are neither deterministic nor stochastic in the standard terminology, they are conditional, thus independant from the distributions of the input strings. Finally, we show through experiments that our method allows us to design cost functions that depend on the string context where the edit operations are used. In other words, we get kinds of \textit{context-sensitive} edit distances

HAL-UJM

Crossref

A note on conformal symmetry in projective superspace

Author: Belz A.
Cancedda N.
Daelemans W.
Dejean H.
Hammerton J.
Koeling R.
Konstantopoulos S.
Nerbonne J.
Osborne M.
Thollard F.
Tjong Kim Sang E.F.
Zajac R.
Publication venue
Publication date: 01/01/2001
Field of study

We describe a sufficient condition for actions constructed in projective superspace to possess an SU(2) R-symmetry. We check directly that this condition implies that the corresponding hyperkahler varieties, constructed by means of the generalized Legendre transform, have a Swann bundle structure.Comment: 21 pages, added reference

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Brighton Research Portal

Institutional Repository Universiteit Antwerpen

Tilburg University Repository

Sur la question de l’élite des Celtes orienta ux à l’âge du Fer

Author: Adam A-M
Arcelin P
Baray L
Benadík B
Billy P-H
Božić D
Brunaux J-L
Buchsenschutz O
Bujna J
Bujna J
Bujna J
Bujna J
Bujna J
Charpy J-J
Chytraček M
Colbert De Beaulieu J-B
Crummy Ph
Delamarre X
Dobesch =G
Dobesch G
D’Agostino B
Egg M
Filip J
Fischer F
Gebhard R
Ginoux N
Ginoux N
Guillaumet J-P
Guštin M
Guštin M
Guštin M
Haffner A
Harbison P
Hauschild M
Hellebrandt M
Hellebrandt M
Hodson F R
Horváth J
Horváth L
Hunyady I
Hunyady I
Jacobsthal P
Joachim H-E
Jud P
Kaenel G
Kimmig W
Kruta Poppi L
Kruta V
Kruta V
Kruta V
Kruta V
Kruta V
Lambert P-Y
Lambrechts P
Le Rider G
Lejars T
Lejars T
Lejars T
Lejars T
M. Szabó
Malitz I
Marion S
Marion S
Maráz B
Mennessier-Jouannet C
Moscati
Márton L
Mócsy A
Nachtergael G
Perrin F
Petres É F F
Peyre Chr
Piggott S
Pink K
Polenz H
Poux dir M
Rapin A
Rapin A
Rapin A
Ratimorská P
Rolley Cl
Roska M
Rustoiu A
Rusu M
Sankot P
Schaaff U
Schönfelder M
Schönfelder M
Schönfelder M
Schönfelder M
Shefton B B
Sievers S
Stead J
Stead J
Stead J
Stojić M
Szabó dir M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szabó M
Szilágyi J Gy
Thollard P
Todorović J
Venedikov J
Végh K K
Végh K K
Waldhauser J
Werner W M
Zirra V
Publication venue: 'Akademiai Kiado Zrt.'
Publication date
Field of study

Crossref

Efficient Pruning of Probabilistic Automata

Author: A. Clark
C. Kermorvant
C. Zhai
F. Casacuberta
F. Thollard
F. Thollard
F. Thollard
J. Callut
N. Abe
P. Dupont
R.C. Carrasco
R.C. Carrasco
T.M. Cover
Publication venue: Springer Verlag
Publication date: 01/01/2008
Field of study

International audienceApplications of probabilistic grammatical inference are limited due to time and space consuming constraints. In statistical language modeling, for example, large corpora are now available and lead to managing automata with millions of states. We propose in this article a method for pruning automata (when restricted to tree based structures) which is not only efficient (sub-quadratic) but that allows to dramatically reduce the size of the automaton with a small impact on the underlying distribution. Results are evaluated on a language modeling task

HAL-UJM

Crossref

Use of Grammatical Inference in Natural Speech Recognition

Author: F. Thollard
Rue P. Michelon
Universit&apos
Publication venue
Publication date
Field of study

This paper presents the application of stochastic grammatical inference to speech recognition. In speech recognition, the acoustic signal process produces a set of words which are combinating to build sentences. Language models are then used to lead the speech recognition application to the most pertinent combination. Up to now, statistical language models are used. We suggest to use stochastic formal grammars instead of statistical models. Theses stochastic grammars will be build by machine learning algorithms. We will first show that unaided grammatical inference cannot be used for speech recognition. We will then make manifest that smoothing is necessary and show the gain that one can obtain by using a basic smoothing. We finally put up a smoothing technic dedicates to stochastic formal grammars. 2 THE QUALITY CRITERION 1 Introduction Our aim is to use stochastic grammatical inference for natural speech recognition. The main difference between validations of grammatical inference..

CiteSeerX

Probabilistic finite-state machines - part II

Author: C. de la Higuera
E. Vidal
F. Casacuberta
F. Thollard
R.C. Carrasco
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

IEEE TRANSACTION PAMI 1 Probabilistic Finite-State Machines – Part II

Author: C. De La Higuera
E. Vidal
F. Casacuberta
F. Thollard
R. C. Carrasco
Publication venue
Publication date
Field of study

Probabilistic finite-state machines are used today in a variety of areas in pattern recognition, or in fields to which pattern recognition is linked. In part I of this paper, we surveyed these objects and studied their properties. In this part II, we study the relations between probabilistic finite-state automata and other well known devices that generate strings like hidden Markov models and -grams, and provide theorems, algorithms and properties that represent a current state of the art of these objects

CiteSeerX